Model Selection

Speech-Text Dual Modality

# Speech-Text Dual Modality

Ichigo Llama3.1 S Instruct V0.3 Phase 3

One of the Ichigo-llama3s series models, focusing on improving the ability to handle ambiguous inputs and multi-turn dialogues, supporting both audio and text inputs.

Text-to-Audio English

Ichigo Llama3.1 S Base V0.3

Llama3-S is a multimodal language model supporting both audio and text inputs, developed based on the Llama-3 architecture with a focus on enhancing speech understanding capabilities.

Audio-to-Text English

Ichigo Llama3.1 S Base V0.3

The Llama3-S series model is a multimodal language model developed by Homebrew Research, natively supporting audio and text input comprehension, extending the speech understanding capability based on the Llama-3 architecture.

Audio-to-Text English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase